You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This pull request introduces several enhancements and bug fixes to the ValidMind Library, particularly focusing on credit risk scorecard modeling. The key changes include:
New Notebooks: Two new Jupyter notebooks have been added to demonstrate the application scorecard model using the ValidMind Library. These notebooks provide a step-by-step guide for loading a demo dataset, preprocessing data, training models, and documenting the model using ValidMind.
New Tests: Several new tests have been added to the validmind/tests directory, including:
MutualInformation: Evaluates feature relevance by calculating mutual information scores between features and the target variable.
ScoreBandDefaultRates: Analyzes default rates and population distribution across credit score bands.
CalibrationCurve: Assesses the calibration of probability estimates by comparing predicted probabilities against observed frequencies.
ClassifierThresholdOptimization: Analyzes and visualizes different threshold optimization methods for binary classification models.
ModelParameters: Extracts and displays model parameters for transparency and reproducibility.
ScoreProbabilityAlignment: Evaluates the alignment between credit scores and predicted probabilities.
Enhancements to Existing Tests: Modifications have been made to existing tests to improve their functionality and accuracy. For example, the TooManyZeroValues test now includes a row count and uses a percentage threshold for zero values.
Dataset Splitting Functionality: The split function in lending_club.py has been enhanced to support an optional validation set, allowing for more flexible dataset splitting.
Test Configuration Utility: A new utility function get_demo_test_config has been added to generate a default test configuration for demo purposes.
Version Update: The version of the ValidMind Library has been updated from 2.7.3 to 2.7.4.
Bug Fixes: Various bug fixes have been implemented, including corrections to test logic and improvements to test coverage.
Test Suggestions
Run the new Jupyter notebooks to ensure they execute without errors and produce the expected outputs.
Verify the functionality of the new tests by running them with different datasets and configurations.
Test the enhanced split function with various dataset sizes and configurations to ensure it correctly handles train, validation, and test splits.
Check the accuracy and performance of the MutualInformation and ScoreBandDefaultRates tests with known datasets.
Validate the CalibrationCurve and ClassifierThresholdOptimization tests by comparing their outputs with expected calibration and threshold optimization results.
Ensure the ModelParameters test correctly extracts parameters from different model types.
Test the ScoreProbabilityAlignment test with datasets having different score distributions.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Internal Notes for Reviewers
Add new application scorecard notebooks using ML with additional testing:
application_scorecard_with_ml.ipynb: running individual testsapplication_scorecard_full_suite: usingrun_documentation_tests()External Release Notes
Add new application scorecard notebooks using ML with additional testing:
application_scorecard_with_ml.ipynb: running individual testsapplication_scorecard_full_suite: usingrun_documentation_tests()